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Abstract: We propose a game-theoretic dynamics of a population of replicating individuals. 
It consists of two parts: the standard replicator one and a migration between two different 
habitats. We consider symmetric two-player games with two evolutionarily stable strategies: 
the efficient one in which the population is in a state with a maximal payoff and the risk- 
dominant one where players are averse to risk. We show that for a large range of parameters 
of our dynamics, even if the initial conditions in both habitats are in the basin of attraction 
of the risk-dominant equilibrium (with respect to the standard replication dynamics without 
migration), in the long run most individuals play the efficient strategy. 
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1 Introduction 



The long-run behavior of interacting individuals can often be described within game-theoretic 
models. Players with different behaviors (strategies) receive payoffs interpreted as the number 
of their offspring. Maynard Smith and Price (1973) (see also Maynard Smith, 1982) introduced 
the fundamental notion of evolutionarily stable strategy. If everybody plays such a strategy, 
then a small number of mutants playing a different one is eliminated from the population. The 
dynamical interpretation of an evolutionarily stable strategy was later provided by several au- 
thors (Taylor and Jonker, 1978; Hofbauer et al, 1979; Zeeman, 1981). They proposed a system 
of differential equations, the so-called replicator equations, which describe time-evolution of 
frequencies of strategies. It is known that any evolutionarily stable strategy is an asymptot- 
ically stable stationary point of such dynamics (Weibull, 1995; Hofbauer and Sigmund, 1998 
and 2003). 

We analyze symmetric two-player games with two evolutionarily stable pure strategies and 
a unique unstable mixed Nash equilibrium. The efficient strategy (called also payoff dominant), 
when played by the whole population, results in the highest possible payoff (fitness). The risk- 
dominant one is played by individuals averse to risk. The strategy is risk-dominant if it has 
a higher expected payoff against a player playing both strategies with equal probabilities than 
the other one (the notion of the risk-dominance was introduced and thoroughly studied by 
Harsanyi and Selten (1984)). 

Typical example is that of a modified stag-hunt game, where two players choose simultane- 
ously one of two actions: either to join the stag hunt (strategy S) or to go after a hare (strategy 
H). The strategy S gives the highest possible payoff provided both players join the stag hunt 
and split the reward. However, when the other player is not loyal and switches to H, then the 
one playing S receives nothing. The strategy H yields lower but less risky payoffs with a higher 
payoff when the other player also chooses H. Such a coordination game has two pure Nash 
equilibria: the efficient one, where all players choose S, and the one in which players are averse 
to risk, and play H. Both strategies are evolutionarily stable and therefore we are faced with 
the problem of equilibrium selection - which strategy will be played in the long run? 

In the replicator dynamics, both strategies are locally asymptotically stable, with the risk- 
dominant one having a bigger basin of attraction. Here we propose a dynamical model for 
which the efficient strategy has a much larger basin of attraction than the risk-dominant one. 

We consider a large population of identical individuals who at each time step can belong 
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to one of two different nonoverlapping subpopulations or habitats which differ only by their 
replication rates. In both habitats they play the same two-player symmetric game. Our pop- 
ulation dynamics consists of two parts: the standard replicator one and a migration between 
subpopulations. Individuals are allowed to change their habitats. They move to a habitat in 
which the average payoff of their strategy is higher; they do not change their strategies. 

Migration can help the population to evolve towards an efficient equilibrium. Below we 
briefly describe the possible mechanism responsible for it. If in a subpopulation, the fraction of 
individuals playing the efficient strategy A is above its unique mixed Nash equilibrium fraction, 
then the expected payoff of A is bigger than that of B in this subpopulation, and therefore 
the subpopulation evolves to the efficient equilibrium by the replicator dynamics without any 
migration. Let us assume therefore that such fraction is below the Nash equilibrium in both 
subpopulations. Without loss of generality we assume that initial conditions are such that the 
fraction of individuals playing A is bigger in the first subpopulation than in the second one. 
Hence the expected payoff of A is bigger in the first subpopulation than in the second one, 
and the expected payoff of B is bigger in the second subpopulation than in the first one. This 
implies that a fraction of A-players in the second population will switch to the first one and 
at the same time a fraction of 5-players from the first population will switch to the second 
one - migration causes the increase of the fraction of individual of the first population playing 
A. However, any 5-player will have more offspring than any A-player (we are below a mixed 
Nash equilibrium) and this has the opposite effect on relative number of A-players in the first 
population than the migration. The asymptotic composition of the whole population depends 
on the competition between these two processes. 

In this note we derive sufficient conditions for migration and replication rates such that the 
whole population will be in the long run in a state in which most individuals occupy only one 
habitat (the first one for the above described initial conditions) and play the efficient strategy. 

In Section 2, we introduce the notation and present the class of games we consider. In 
Section 3, we propose a discrete-time model, obtain its continuous counterpart and prove our 
results. Section 4 contains a short discussion. 
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2 Replicator dynamics 



We consider two-player symmetric games with two pure strategies and two symmetric Nash 
equilibria. A payoff matrix is given by 

A B 
A a b 

U = 

Bed, 

where the ij entry, i,j = A,B, is the payoff of the first (row) player when he plays the 
strategy i and the second (column) player plays the strategy j; payoffs of the column player 
are given by the matrix transposed to U . 

An assignment of strategies to both players is a Nash equilibrium, if for each player, for a 
fixed strategy of his opponent, changing the current strategy cannot increase his payoff. 

In order to use our payoff matrix in the dynamics of reproducing species, we assume that 
all payoffs are non-negative. We also assume that a > d > c, d > b, and a + b < c + d which 
implies in particular that (A, A) and (B, B) are two strict Nash equilibria hence both A and 
B are evolutionarily stable. The last inequality means that an expected payoff of B against 
a player playing both strategies with equal probabilities is higher than that of A. We address 
the problem of equilibrium selection between the payoff-dominant (efficient) equilibrium (A, A) 
and the risk dominant (B, B). Our assumptions imply also that a £>-player receives a maximal 
payoff when he plays against another 5-player. 

As an example, consider the following payoffs: a = 4, 6 = 0,c = 2,rf = 3. The strategy profile 
(A, A) is more risky than (B, B) since from the point of view of the row player, a deviation by 
the column player in (A, A) results in a payoff loss of 4 units versus a loss of 1 in (B, B). 

In the replicator dynamics, individuals of a large population are matched many times to 
play the above described stage game. Let x be the fraction of the population playing A. The 
expected payoff of A is given by Ja = o,x + 6(1 — x) and that of B by fs = cx + d(l — x). 
A mixed Nash equilibrium strategy is such x* that the above two expected values are equal, 
hence x* = (d — b) / \d — b + a — c). In the standard replicator dynamics, the rate of change of 
x is proportional to the expected payoff of A. The corresponding differential equation reads 
(Weibull, 1995; Hofbauer and Sigmund, 1998) 
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— = x(f A -f B )=x(l-x)(x-x*). (1) 

If the initial condition x(0) < x*, then the population evolves in time to the state x — 0, 
i.e., in the long run almost all individuals will play B. If x(0) > x*, then x = 1 is an attracting 
point of the dynamics. If B is risk-dominant, then x* > 1/2, hence B has a bigger basin of 
attraction than A. 

Below we propose an evolutionary dynamics for which an efficient strategy has a much 
bigger basin of attraction than the risk-dominant one. 

3 Mixed replicator dynamics with migration 

We consider a large population of identical individuals who at each time step can belong to one 
of two different subpopulations or habitats. In both subpopulations they play the same two- 
player symmetric game. Our population dynamics consists of two parts: the standard replicator 
one which represents slow evolutionary changes and a migration between two habitats. Let S 
be a time step. We allow two subpopulations to replicate with different speeds. We assume 
that during time-step 6, a fraction 8 of the first subpopulation and a fraction kS of the second 
subpopulation plays the game and receives payoffs which are interpreted as the number of their 
offspring. Moreover, we allow a fraction of individuals to migrate to a habitat in which their 
strategies have a higher expected payoff. 

Let r\ denote the number of individuals which use the strategy s G {A, B} in the subpop- 
ulation % e {1, 2}. By /] we denote the expected payoff of the strategy s in the subpopulation 
i: 

f\ — ax + b{l — x), f B = cx + d(l — x), 
fl = ay + b(l-y), f% = cy + d(l - y), 

where 

1 2 

x = —, y=—, r 1 =r 1 A + r 1 B , r 2 = r 2 A + r 2 B ; 

x and y denote fractions of A-players in the first and second population respectively. We denote 
by a = y the fraction of the whole population in the first subpopulation, where r = r\ + r 2 is 
the total number of individuals. 
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The evolution of the number of individuals in each subpopulation is assumed to be a result 
of the replication and the migration flow. In our model, the direction and intensity of migration 
of individuals with a given strategy will be determined by the difference of the expected payoffs 
of that strategy in both habitats. Individuals will migrate to a habitat with a higher payoff. 
The evolution equations for the number of individuals playing strategy s, s G {A, B}, in habitat 
i, i G {1, 2}, have the following form: 

r\(t + 5) = R A + ^ (2) 

r B (t + 5) = R B + $ B , (3) 

r\{t + 5) = R\-^ A) (4) 

r 2 B (t + 5) = R 2 B -$ B , (5) 

where all functions on the right-hand sides are calculated at time t. 

Functions R\ describe an increase of the number of the individuals playing strategy s in the 
subpopulation i due to replication, cf. (Weibull, 1995): 

R\ = {\-S)^ + 8flr\, se{A,B}, (6) 
R 2 S = (1 - K5)r 2 s + n5f 2 r 2 , se{A,B}, (7) 

The rate of replication of the individuals playing strategy s in the first subpopulation is given 
by Sf} , and in the second subpopulation by n5f 2 . The parameter k measures the difference 
of reproduction speeds in both habitats. 

Functions $ s , s G {A, B}, are defined by 

®s = 6^(f s - f 2 )[r 2 Mfl - f 2 ) + r]e(f 2 - £)], (8) 
where 9 is the Heaviside's function, 

Functions describe changes of the numbers of the individuals playing strategy s in the 
relevant habitat due to migration. $ s will be referred to as the migration of individuals (who 
play strategy s) between the habitats. 
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Thus, if for example f\> f\, then there is a migration of individuals with strategy A from 
the second habitat to the first one: 

^A = hr 2 Af\-fl), (10) 

and since then necessarily < j\ [note that j\ — j\ — (a — b)(x — y) and f^ — fs = 
(c — d)(x — y)}, there is a migration flow of individuals with strategy B from the first habitat 
to the second one: 

*B = *yri(*)(/i-/i). (ii) 

In this case, the migration flow $^4 describes the increase of the number of individuals which 
play strategy A in the first subpopulation due to migration of the individuals playing strategy 
A in the second subpopulation. This increase is assumed to be proportional to the number 
of individuals playing strategy A in the second subpopulation and the difference of payoffs of 
this strategy in both subpopulations. The constant of proportionality is S times the rate of 
migration 7. 

The case 7 = corresponds to two separate populations which do not communicate and 
evolve independently. Our model reduces then to the standard discrete-time replicator dynamics 
(Weibull, 1995). In this case, the total number of players who use a given strategy changes 
only due to the increase or decrease of the strategy fitness, as described by functions defined 
in (6-7). 

In the absence of replication, 5 = 0, there is a conservation of the number of individuals 
playing each strategy in the whole population. This corresponds to our model assumption that 
individuals can not change their strategies but only habitats in which they live. 

For f\ > f\ we obtain from (2-5) equations for r;(t) and r(t): 



ri (t + 5) = (l- 5) n (t) + fr 1 (*)[ r ^ + r * / * + 7 ^ /1) + r ^ (/j? % (12) 



r 2 (jt + 5) = (1- K5)r 2 {t) + Sr 2 (t)[ K ^ + ^ + /^(/l /A)+^(/l fh) ^ (13) 

r 2 r 2 



r(t + 5) = (1 - 5) ri (t) + (1 - K5)r 2 (t) + 5r(t) [ a £Af\ + ^ ) + (1 - a)^fl + -fl)\ , (14) 

n n r 2 r 2 
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where all functions in square brackets depend on t. 

Now, as in the derivation of standard replicator dynamics (cf. e.g. Weibull, 1995), we 
consider the frequencies of the individuals playing the relevant strategies in both habitats 
rather than their numbers. Thus, we focus on the temporal evolution of the frequencies, x and 
y, and the relative size of the first subpopulation, a. We divide (2) by (12), (4) by (13), and 
(12) by (14). Performing the limit 5 — > we obtain the following differential equations: 



^ = x[(l - x)(f\ - f B ) + 7[(^-^ " V ^—^){f\ ~ fl) " (1 " x){fl - /I)]], (15) 
at xa a 

§ = - y)(fl - fl) + 7[(i - y){fl - fl) - {1 ^{fl - fh)]l (16) 

at 1 — a 

^ = a(l - a)[xf\ + (1 - x)f B - (yf A + (1 - y)f B )\ (17) 

+«7[^^(/A - fl) + (1 - x)(f B - f B )\ + a(l - a)(« - 1)(1 - y/i - (1 - y)f B ). 
a 

Similar equations are derived for the case f\ < f\ (since our model is symmetric with 
respect to the permutation of the subpopulations, it is enough to renumerate the relevant 
indices and redefine the parameter k). 

Assume first that f\(0) > /1(0), which is equivalent to x(0) > y(0). It follows from (2-5) 
that a fraction of A-players from the subpopulation 2 will migrate to the subpopulation 1 and 
a fraction of 5-players will migrate in the opposite direction. This will cause x to increase and 
y to decrease. However, if x(0) < x* and y(0) < x*, then f\ < f B and j\ < /|, therefore 
-B-players will have more offspring than A-players. This has the opposite effect on the relative 
number of A-players in the first subpopulation than migration. If x(0) < y(0), then migration 
takes place in the reverse directions. 

The outcome of the competition between migration and replication depends, for a given 
payoff matrix, on the relation between x(0) — y(0), 7 and k. We are interested in formulating 
sufficient conditions for the parameters of the model, for which most individuals of the whole 
population will play in the long run the efficient strategy A. We prove the following theorem. 

Theorem 
If 

7[x(0) — y(0)\ > max[^ 



d — c a — b 
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then x(t) ^t^oo 1 and y(t) -> t -oo 0. If k < (a - l)/(d- 1), then a(t) ^ t ->oc 1- 
If 

r . , . , , r n(d — b) a — c. 

7 1/ 0) - i O > max[^ S , 

a — c a — b 

then x(t) ^ t _oo and y(t) -> t -oo 1- If /c> (d - l)/(a - 1), then ^ t ^oo 0. 
Proof: 

Assume first that x(0) > y(0). From (15-16) we get the following differential inequalities: 

^>x(l-x)[/i-/i)+7(/|-/i)], (18) 

f < y(i - v)[k(# " /I) + lUl ~ /A)], (19) 
Using explicit expressions for f\ we get 

dx 

— >x(l-x)[(a-c + d-b)x + b-d + j(d-c)(x-y)], (20) 
dt 

^<y(l- y)[ K [(a - c + d - b)y + b - d] - 7 (a - b)(x - y)}, (21) 

We note that if -f(d - c)(x(0) - y(0)) > d- b then -f(d - c)(x(0) - y(0)) + b - d+ (a - c + 
d - b)x(0) > 0, i.e. dx/dt(0) > 0. 

Analogously, if 7(0 — b)(x(0) — y(0)) > k(cl — c), then 7(0 — b)(x(0) — 2/(0)) > n[(a — c + 
d — b) + b — d] > k[(cl — c + d — b)y(o) + b — d\, therefore dy/dt(0) < 0. Thus, combining both 
conditions we conclude that x(t) — y(t) is an increasing function so x(t) > y(t) for all t > 0, 
hence we may use (15-17) all the time. We get that x(t) ^t^oo 1 and y(t) ^t~*oo 0, and the 
first part of the thesis follows. Now from (17) it follows that if a — d + (k — 1)(1 — d) > 0, i.e. 
k < (a — l)/(d— 1), then a(t) ^t^oo 1- 

The second part of our Theorem, corresponding to initial conditions y(0) > x(0), can be 
proved analogously, starting from eqs. (2-5) written for the case f\(0) < /1(0) and their 
continuous counterparts. We omit details. □ 

The above conditions for k mean that the population consisting of just A-players replicates 
faster [exponentially in (a — l)t] than the one consisting of just -B-players [exponentially in 
(d — \)nt]. The same results would follow if the coefficients of the payoff matrix of the game 
played in one habitat would differ from those in the second habitat by an additive constant. 
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We have shown that for 7 and the difference of the initial conditions x(0) — y(0) satisfying 
certain inequalities, the fraction of A-players in one of the subpopulations converges to 1, 
the relative size of this subpopulation converges to 1, and therefore the fraction of the whole 
population playing A, z — r^jr = xa + y(l — a) converges to 1. Which subpopulation prevails 
in this sense depends on the initial conditions, whether x(0) > y(0) or y(0) > x(0). 

Let us note that the main result of our paper, as stated in the Theorem, is also valid in the 
special case of equal replicator rates in each habitat, i.e. when k = 1. 

4 Conclusions 

We discussed a dynamics of a single population of individuals playing a symmetric game. At 
any time, each individual resides in one of two disjoint different habitats. Differences between 
habitats (e.g. different ecology, fauna, flora, and feeding capacities) result in different speeds of 
replication. Players have knowledge of the expected payoffs from the strategy they are identified 
with in both habitats, and they choose the one with a higher expected payoff. The process of 
changing a habitat is referred to as migration. We assumed that the coefficient of migration is 
the same in both subpopulations. More general case would give another (apart of the difference 
in replicator speeds) characterization of both habitats. 

We showed that introduction of this mechanism of "attraction" by the habitat with a higher 
expected payoff in the standard replicator dynamics can help the whole population to reach 
the state in which in the long run most individuals play the efficient strategy. 

More precisely, we proved that for a given rate of migration, if the fractions of individuals 
playing the efficient strategy in both habitats are not too close to each other, then the habitat 
with a higher fraction of such players overcomes the other one in the long run. The fraction of 
individuals playing the efficient strategy tends to unity in this habitat and consequently in the 
whole population. Alternatively, we may say that the bigger the rate of migration is, larger is 
the basin of attraction of the efficient equilibrium. In particular, we showed that for a large 
range of parameters of our dynamics, even if the initial conditions in both habitats are in the 
basin of attraction of the risk-dominant equilibrium (with respect to the standard replication 
dynamics without migration), in the long run most individuals play the efficient strategy. 

The sizes of basins of attraction play an important role when deterministic dynamics is 
subject to stochastic perturbations due to mutations or players mistakes. The long-run be- 
havior of noisy evolutionary processes can be described by stochastic stability of deterministic 
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equilibria (Foster and Young, 1990). It was shown in many models that equilibria with larger 
basins of attraction (risk-dominant ones in the standard replicator dynamics) are stochastically 
stable (Harris and Fudenberg, 1992; Kandori et al, 1993); see also Robson and Vega-Redondo 
(1996), Vega-Redondo (1996) and Mi§kisz (2005) for models with randomly matched players. 
This is a consequence of the fact that the population needs more mutations to evolve from an 
equilibrium with a larger basin of attraction to the other equilibrium than in the case of the 
opposite transition. Therefore, for a low level of mutations, the population settles in the long 
run with a high probability in a state with a larger basin of attraction. 

We plan to investigate in the future an appropriate stochastic version of our model. One has 
to carefully examine basins of attraction of both equilibria. The result of our paper suggests 
that the efficient equilibrium would be stochastically stable. 
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